Threshold for Positional Weight Matrix

نویسندگان

  • Yunlian Pan
  • Sieu Phan
چکیده

weight matrix (PWM) is often used to search for putative transcription factor binding sites. A set of experimentally verified oligonucleotides known to be functional motifs are collected and aligned. The frequency of each nucleotide A, C, G, or T at each column of the alignment is calculated in the matrix. Once a PWM is constructed, it can be used to search from a nucleotide sequence for subsequences that can possibly perform the same function. The match between a subsequence and a PWM is usually described by a score function, which measures the closeness of the subsequence to the PWM as compared with the given background. Nevertheless, the score function is usually motif-length-dependent and thus there is no universally applicable threshold. In this paper, we propose an alternative scoring index (G) varying from zero, where the subsequence is not much different from the background, to one, where the subsequence fits best to the PWM. We also propose a measure evaluating the statistical expectation at each G index. We investigated the PWMs from the TRANSFAC and found that the statistical expectation is significantly (p<0.0001) correlated with both the length of the PWMs and the threshold G value. We applied this method to two PWMs (GCN4_C and ROX1_Q6) of yeast transcription factor binding sites and two PWMs (HIC1-02, HIC1_03) of the human tumor suppressor (HIC-1) binding sites from the TRANSFAC database. Finally, our method compares favorably with the broadly used Match method. The results indicate that our method is more flexible and can provide better confidence.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Estimating the Parameters for Linking Unstandardized References with the Matrix Comparator

This paper discusses recent research on methods for estimating configuration parameters for the Matrix Comparator used for linking unstandardized or heterogeneously standardized references. The matrix comparator computes the aggregate similarity between the tokens (words) in a pair of references. The two most critical parameters for the matrix comparator for obtaining the best linking results a...

متن کامل

Determination of the electrical percolation threshold of Polystyrene-Graphene Oxide nanocomposite using the experiment and simulation methods

Carbon nanostructures via adding to a polymer matrix, while improving the electrical, mechanical and optical properties of the nanocomposites, are widely used in the industry, medicine, and agriculture. The authors presented several investigations on a new type of gamma dosimeter based on the polymer-carbon nanostructures nanocomposite. In this research, the electrical percolation threshold of ...

متن کامل

Local indicators of geocoding accuracy (LIGA): theory and application

BACKGROUND Although sources of positional error in geographic locations (e.g. geocoding error) used for describing and modeling spatial patterns are widely acknowledged, research on how such error impacts the statistical results has been limited. In this paper we explore techniques for quantifying the perturbability of spatial weights to different specifications of positional error. RESULTS W...

متن کامل

ANFIS-based Fuzzy Systems for Searching DNA-Protein Binding Sites

Transcriptional regulation mainly controls how genes are expressed and how cells behave based on the transcription factor (TF) proteins that bind upstream of the transcription start sites (TSSs) of genes. These TF DNA binding sites (TFBSs) are usually short (5-15 base pairs) and degenerate (some positions can have multiple possible alternatives). Traditionally, computational methods scan DNA se...

متن کامل

Positional noise in Landolt-C stimuli reduces spatial resolution: A study with younger and older observers

In the present study we examined the effect of positional noise on spatial resolution in younger and older observers. We used a yes/no discrimination task in which observers indicated whether the size of two gaps in a Landolt-C-like contour was the same or not. The proportion of trials observers perceived one gap larger was measured when gaps-position was fixed (low positional noise) and random...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Engineering Letters

دوره 16  شماره 

صفحات  -

تاریخ انتشار 2008